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A PROCESS FOR IDENTIFYING PHARMACEUTICAL!^ ACTIVE 
AGENTS USING AN EPITOPE-TAGGED LIBRARY 

The present invention relates tio an in^proved process ITgt identifying 
phasmsceutically active ©gents. 



10 BACKGROUND OF TOE INVENTION 

Over the past ten years, there has been a growing demand for the production 
and identification of agents that have pharmacological activity m 0 for example, 
agonists or antagonists of various cellular ©eceptar molecules, such as cell-surface 
receptors, enzymes csr antibodies. In the continuing search fosr new chemical 

IS moieties that can effectively modulate a variety of biological processes, the standard 
method for conducting a search is to screen a variety of pre-existing chemical 
moieties, for example, naturally occurring compounds or compounds which exist in 
synthetic libraries ©a* databanks. The biological activity of the pre-existing chemical 
moieties is determined by applying the moieties to an assay which has been 

20 designed to test a particular property of the chemical moiety being screened, for 
example, a receptor binding assay which tests the ability of the moiety to bind to a 
particular receptor site. 

In an effort to reduce the time and expense involved in screening a large 
number of randomly chosen compounds for biological activity, several 

25 developments have been made to provide libraries of compounds for the discovery 
of lead compounds. The chemical generation of molecular diversity has become a 
major tool in the search for novel lead structures* Currently, the known methods for 
chemically generating large numbers of moleculariy diverse compounds generally 
involve the use of solid phase synthesis, in particular to synthesize and identify 

30 peptides and peptide libraries. See, for example, Lebl et al., fns. J. Pept. Pros. Res., 
41, p. 201 (1993) which discloses methodologies providing selectively cleavable 
linkers between peptide and resin such that a certain amount of peptide can be 
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liberated from the resin and assayed in soluble form while some of the peptide still 
remains attached to the resin, where it can be sequenced; Lam eft aL, Nature* 3§3, p. 
82 (1991) and (WO 92/00091) which disclose a method of synthesis of linear 
peptides on a solid support such as polystyrene oypolyacsykmide resin; Geysen et 
5 al., /. Immunol. Mesh., 2102, p. 259 (1987) which discloses the synthesis of peptides 
on derivatized polystyrene pins which are arranged on a block in sudh a way that 
they correspond to the arrangement of wells in a 96-well microtitter plate; and 
Houghten et al., Nature, p. 84 (1991) and WO 92/093C0 which discJose an 
approach to de novo determination of antibody or receptor binding sequences 

10 involving soluble peptide pools. 

Nonpeptidic organic compounds, such as peptide mimetics, can often surpass 
peptide ligands in affinity for a certain receptor of enzyme. An effective strategy 
for rapidly identifying high affinity biological ligands, and ultimately new and 
important drugs, requires rapid construction and screening of diverse libraries of 

15 non-peptidic structures containing a variety of structural units capable of 

establishing one or moire types of interactions with a biological acceptor (e.g. p a 
receptor or enzyme), such as hydrogen bonds, salt bridges, pi-complexation, 
hydrophobic effects, etc. However, wortc on the generation and screening of 
synthetic test compound libraries containing nonpeptidic molecules is now in its 

20 infancy. One example from this area is the worts of Ellm&n and Bunin on a 

combinatorial synthesis of benzodiazepines on a solid support (J. Am. Chem. Sec. 
1 14, 10997, (1992); see Qbtarcd^^ January 18, 1993, page 

33). 

Historically, the process.to identify pharmacologically active agents has been 
25 characterized as time-consuming, labor-intensive and inefficient and usually 

involves a single biological target per screening effort. Further, efforts to improve 
the process have focused entirely of increasing the number and quality of agents per 
screening effort. Until now there has been little if any emphasis on improving the 
efficiency with which biological targets are utilized in screening ass&ys. 
30 Additionally, efforts are underway to fully characterize the human genome. 

This initiative has already produced thousands of novel gene sequences, many of 
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which are of unknown function. To identify which of these genes present 
opportunities for pharmaceutical intervention would take years of diligent effort using 
current methodology. Presently there is a need for unproved methods for efficiently 
identifying phaamaceutically active agents and for better utilisation of information 
5 resulting from the human genome initiative 

Many of the disadvantages of the known methods as well as many of the 
needs nos meg by them are addressed by the present invention which, as described 
more fully hereinafter, provides numerous advantages over the known methods. 

10 SUMMARY QF THE KNVENTIQW 

This invention relates to a process fos- identifying pharm&ceutically active 
agents which comprises simultaneously expressing a plurality of Uniquely Tagged 
Target Agents to form a Target library; preparing an Agent Candidate Pool and 
testing the Target Literary and the Agent Candidate Fool in an assay which identifies 
15 agents of the candidate pool having desired characteristics and to pharmaceutical^ 
active agents identified by such process. 

This invention also relates to a process for simultaneously expressing a 
plurality of uniquely tagged genes to foam a Target Literary. 

This invention also relates to © process for simultaneously expressing a 
20 plurality of uniquely tagged gene products to foam a Target Library. 

This invention also relates to a process for simultaneously expressing a 
plurality of uniquely tagged membrane-bound receptors to form a Target Library* 

This invention also relates to a Target Library comprising a plurality of 
expressed uniquely tagged genes. 
23 This invention also relates to a Target library comprising a plurality of 

expressed uniquely tagged gene products* 

This invention also relates to a Target library comprising a plurality of 
expressed uniquely tagged memteane-bound receptors. 

This invention also relates to Spatially Encoded Targes Libraries. 
30 This invention also relates to methods of screening for pharmaceutical activity 

utilizing Spatially Encoded Target Libraries. 
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. DETAILED DESCRIPTION OP TODS INVENTION 
As used herein, she term "Agent Candidate Bool" means a sowsce of one or 
more entities selected front the group consisting of: chemical compounds, sush as 
5 mixtures of individual compounds, peptides, natural products and moaoctonal 
antibodies which are go be tested for pharmaceutical activity. These entities are 
optionally tested as combinatorial compound Kbrsries, combinatorial peptide libraries 
c? varioastars. 

As (used herein, the term 'Target library* means a plurality of expressed 
1 0 Uniquely Tagged Target Agents* 

As used herein, the term Uniquely Tagged Target Agent" means uniquely 
tagged genes, uniquely tagged gene products or uniquely tagged membrane-bound 
receptors. 

As used herein, the term "gene products" means a protein produced by the 

IS expression of a gene. For example, an enzyme, a receptee or a docking protein such 
as an SH2 domain. 

As used herein, the term "uniquely tagged gene($)" means that, prior to 
expression, each gene that will make up die Target library is engineered with si 
distinct marker such as a short peptide sequence (hereinafter epitope tag) which is 

20 capable of recognition by specified antibodies. Said epitope nags can be 

manufactured in large number. Additionally, several genes can be grouped with the 
same tag, subgrouped with different tag combinations or tandem tags can be 
assembled to allow sequential selection. 

As used herein, the term "uniquely tagged gene produces)" means that, prior 

25 to expression, each gene product that will make up the Target Library is engineered 
with a distinct marker such as a short peptide sequence (hereinafter epitope tag) 
which is capable of recognition by specified antibodies. Said epitope tags can be 
manufactured in large number. Additionally, several gene products can be grouped 
with the same tag, subgrouped with different tag combinations or tandem tags can be 

30 assembled to allow sequential selection. 
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As used herein, the term "uniquely sagged memtesne-bcaand receptors)" 
means that, prior to expression, each receptor that will make asp the Target Oteary is 
engineered with a distinct* mazte such as a shaft peptide sequence (hes^eimifter 
epitope Sag), which is capable of FscognMon by verified antibodies, a?fcd Ghera 
5 incorporated into suitable recipient cells. Said epitope tags cm fee msnufscms^d in 
krge number. Additionally, several raoeptors can be grouped with she some t&g, 
subgroups with different tag combinations or saftdsaxa tags eon be assembled Go 
alEow sequential selection, Inoorpcssition of o@© tastes reospsa? ttyps pes* cell is 
preferred, as this mil enable adjustment of the relative composition of the Target 

10 library by mixing together different numbers of cells. This cell mixture will 
constitute the Target Library in this example, 

Many receptors require a cell membrane for proper binding and functional 
responses to ligands, for example 7 transmembrane G-protein coupled receptors. As 
used herein, receptors which require a cell membrane for proper binding and 

IS functional response and which ore incorporated into a cell are referred to as 
"membrane-bound receptors". Generally, applying Target likrasy screening 
approaches to membrane-bound types of receptor can be perforated as indicated 
herein. A preferred method of screening membrane-bound receptors is to use one or 
more chemical compounds or a combinatorial chemical library as the Agent 

20 Candidate Pool and provide a photoactivatable group in the template during the 
synthesis of the compounds or combinatorial library. Examples of such groups are 
aryl azides or benzophenones. Additionally, a readily detectable master such as 
biotin is preferably incorporated into the template. Thus, esch compound will 
preferably carry both the photoactivatable and detection groups. Alternatively, if 

23 the ligands for the targets are known, the photoactivatable and detection tags need 
only be incorporated into the ligands, and unlabelled compounds can be screened by 
competition. In this preferred example of the invention the compound os* 
combinatorial library is mixed with the Target Library and irradiated to activate the 
photoactivatable group. The cells of the Target library are lysed and the mixture is 

30 added to an array of spatially encoded antibodies raised against the epitope tags of 
the Target Library. The matrix of antibodies is probed with reagents bo detect the 
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marker (for example: streptavidin-peroxidase to detect biodn). Hie spectrum of 
gargets that interact with o given compound or group of compounds can be identified 
by association of the detection marker with a particular subset of epitope tags. If 
necessaxy, the Target Library can be pre-screened for incorporation of the detection 
5 marker' prior to deconvolution on the antibody matrix. 

As used herein, the v&m "simultaneously expressed" means that num@ous 
Uniquely Tagged Target Agents are expressed ©nd purified together without regasds 
to maintaining the identity of the subject gene, gene product <rcr memfcrane-toiond 
receptor. 

JO As used herein, the terns "assay", "screen* cr "testing for biological activity" 

includes any form of testing for pharmaceutical^ relevant activity. 

In preparing the Target Library, one may initially begin with any manageable 
number of genes, preferably less than one hundred. Preferably these genes are 
grouped for example by expression in tissues of interest or by membership in a 

15 common family, such as kinases, based on sequence homology. Each gene is 
uniquely tagged preferably with an epitope tag and simultaneously expressed. 

In preparing the Target Library, one may initially begin with any manageable 
number of gene products, preferably less than one hundred. Preferably these genes 
are grouped for example by expression in tissues of interest cur by membership Sn a 

20 common family, such as SH2 domains* based cai sequence homology. Each gene 
product is uniquely tagged preferably with an epitope tag and simultaneously 
expressed. 

In preparing the Target Library, one may initially begin with any manageable 
number of receptors, preferably less than one hundred. Preferably these receptees are 
25 grouped for example by expression in tissues of interest or by membership in a 
common family, such as 7 transmembrane G-protein coupled receptors, based on 
sequence homology. Each receptor is uniquely tagged preferably with an epitope tag, 
incorporated into a cell and simultaneously expressed. 

To identify compounds that perturb the function of the targets in the Target 
30 Library, an Agent Candidate Pool, preferably one or more chemical compounds or a 
combinatorial compound library, is prepared by standard methods. Active agents are 
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identified by known essay techniques or by engineering a selection steg> b©sed ©a t&e 
anticipated activities contained in the Target librsry (as described in the Examples 
below). Beconvolution of the Target Library can be ©ccomplish^ 
epitope tag, white analytical methods such as warns spectral techniques, cm be used 
5 for Agent Candidate Bool analysis. Thus she Mentific&stea of cctiv© agoits and eh© 
Junction of the Uniquely Tagged T^-get Ageiaa onslyzed to psnlM. 

Fharroaceutically active agents identified by tifoe above process ase optionally 
screened in cell based assays for a function of in&srest, followed by m opticmofl 
functional screen or animal model. Also included within the esqjse to the pressac 

10 invention are pharmaceutical^ active agents identified by the processes disclosed 
herein, While is not necessary to identify the specific targes (gea^ gene product cs- 
receptor) affected, if toxicity problems arise this information could be potentially 
useful and could readily be obtained retrospectively. Further 1 , parallelism is 
maintained in the secondary screens, with multiple compounds being simultaneously 

IS evaluated in a battery of cell-based assays. 

A particularly advantageous aspect of this invention is the ability to maintain 
parallelism in all phases of the discovery process Le. target selection, expression mud 
purification, compound synthesis, primary and seocmdasy screening* Additional 
advantages of this process include: the ability fc? multiple targets to be 

20 simultaneously evaluated without prior knowledge of function osr disease association, 
built-in selectivity profiling, greater chance of uncovering totally unexpected activities 
and disease indications, increased efficiency by utilizing more drug candidates per 
unit time. 

Within the general framework of the parallel approach described above, 
25 numerous methodologies for screening Target Libraries are possible. One such 
methodology is described in Example 1. All such methodologies are within the 
scope the invention as claimed herein. 

In a further aspect of the invention there is provided a preferred method for 
screening Target Libraries which utilizes expressed Uniquely Tagged Target 
30 Agents, wherein the tagging marker is an epitope tag, spatially encoded in discrete 
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areas of a substrate (referred to herein as Spatially Encoded Target Library or 
SETL). Spatially Encoded Target Libraries for use herein are prepared by: 

i) depositing groupings of antibodies, each grouping being directed against a 
different epitope tag of the Target Library and each epitope tag of the Target 

5 library corresponding to as least one antibody grouping, in ora array (hereinafter 
antibody region), preferably in a mosaic pattern, on a substrate, preferably a 
polyvinylidene difluoside (PVDF) membrane, a nitrocellulose membrane, n 
polystyrene laye? or a siliooaa wafer, - 

ii) blocking the antibody region of the membrane with an inert protein and 
10 iii) exposing the Target Library to the andbody region. 

Andbody groupings will selectively bind to their corresponding epitope tags, 
thereinby localizing like groups of Uniquely Tagged Target Agents to a specific 
area of the membrane. The above process is repeated, as desired, in order to 
produce a plurality of functionally similar SETL& 

15 The deposition of the antibodies to the membrane can be accomplished by 

any means, for example manually spotting the membrane with a capillary pipette, or 
in an automated fashion by "micro-spotting' in small predetermined deposit areas, for 
example by the use of printing techniques such as those used in ink jet printers. The 
membrane carrying the immobilized antibodies is preferably bonded to a vessel, 

20 such as Merrifield synthesis vessels, flasks, or preferably microti ter 96-well plates, 
prior to step ii above, such that each vessel contains an entire antibody region. 

The Agent Candidate Pool is portioned into the vessels along with other 
assay reagents. Following a detection step, the membrane is imaged to identify 
activity of the Uniquely Tagged Target Agents of the Target Library and the effects 

23 of the Agent Candidates. 

The SETL approach is advantageously used in developing screening assays 
for receptors or other binding proteins, for example SH2 domains, as shown in 
Examples 2, 3 and 4 below. 

Without further elaboration, it is believed that one skilled in the art can, 

30 using the preceding description, utilize the present invention to its fullest extent. 
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The following Examples are, therefore, to be construed as merely illustrative and 
am a limitation of the scope of the present invention in any way. 

The cell type responsible for new bone production is the osteoblast. Ac o 
result of the application of large scale sequencing techniques to osteoblast dDNA 
libraries, a number of novel full length kinases have been identifbdo A nunab^ of 
full length kinases from osteoblast-li&e cells each uniquely tagged with one <srr 

10 more epitope tags. For example, a unique identifier tag can be attached in tandem 
with a common purification tag to facilitate the purification process* The genes are 
gagged and simultaneously expressed in an acceptable host, e.g. E. Coli car 
baculovirus. The tags are used to simultaneously purify all members of the Mnase 
Target Library as a mixture, which is characterized by Western blotting with 

15 antibodies to the unique tags. Suitable substrates are identified by sepas^telly 

immobilising each member of the Target litawy via its epitope flag and adding a 
mixture of phosphate acceptors along with iso&opically labelled! ATP. 
Phosphorylated substrates are identified by mass spectrometric techniques. 

A compound combinatorial library is synthesized by standard methods. The 
20 Target Library is spatially encoded (as described above) along with suitable 

substrates. The latter are synthesized so that they contain both a phosphate acceptor 
and an epitope tag to co-localize them with the relevant kinase targets. Test 
compounds are added to each array, along with isotopically labelled ATP. The 
arrays are imaged to identify areas where phosphate has been incospomted. 

25 Active agents from Example 1 are optionally screened in a secondary assay, 

such as an osteocalcin release assay which correlates with bone formation. 
Additionally, active agents from Example 1 are screened in a battery of cell based 
assays covering functions unrelated to bone formation. Preferably, said cell based 
assay utilizes reporter gene technology, for example - as described in US patent 

30 numbers 5,401,629 and 5,436,128. Such technology utilizes the promoter regions 
of various genes of interest e.g. osteocalcin, coupled to a reporter gene e.g. 
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lucif erase, and transfected into an appropriate cell line. When this is done m a 
multiposent cell line e.g. embryonic stem cells, the same cells may be used with 
different reposter constructs so simultaneously assay for a variety of functiom 
(hereinafter 'assay library 9 ). Thus the parallel approach of the paressnt invention is 
5 maintained all die way to the secondary assay, giving advantages of efficiency and 
wide profiling of active compounds. 

Preferably the active agents are then evaluated an animal models in vifiso 
functional assays and screened for toxicity. 

The SETL, approach could be used to configure a screening assay fosr 
receptors or other binding proteins, such as SH2 domains. SH2 domains bind 
phosphorylaied peptide ligands. To configure a SETL assay for SH2 domains, the 
domains are expressed as epitope tagged fusion proteins. Antibodies Go the tags are 

IS deposited on membranes as above. A mixture of tagged SH2 domains is added to 
each well and allowed to bind to the corresponding antibodies. A mixture of 
biotinylated phosphopeptide ligands (one for each SH2 target) is added along with 
test compounds. Binding of the biotinylated peptide ligands as detected by addition 
of streptavidin-biotin complex and colorimetric, fluorescence or enhanced 

20 chemiluminescence (EGL) detection reagents. Imaging of the activity of the 

detection reagents on the membrane surface allows binding of the Mgand to each 
individual SH2 domain to be measured. Such imaging could be automated. 

A working demonstration of this assay was accomplished by expressing the 
human src SH2 domain as a fusibn protein. 

25 Hie fusion protein containing the human ssc SH2 domain was expressed as 

the general sequence: DETl-DET2-spacer-ek-src SH2, where DET1, BET2, spacer 
and ek are as described below. DET1 ("defined epitope tag 1") (SEQ ID NO: 1) is 
an 1 1 amino acid sequence found in the Human Immunodeficiency Virus Type 1 
(HIV-1) envelope protein gp!20 (or gpl60). Monoclonal antibodies to various 

30 epitopes of HIV-1 gpl20 (or gp!60) are known in the art, see, for example U.S. 
Patent 5,166,050. One preferred example is monoclonal antibody 178.1 (see, e.g., 
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Thiriart et al., XJtrommolL Ml:1832-1836 (1989)). which was prepared by 
immunisation of mice with o yeast-expressed HIV-1 gplfiO molomte Srom strain 
BH10 (Ratner et aL 0 Nafrre. 313:277-284 (1985)). This tag was used for detection 
of expression (by Western Mm), for p*srificsticft of the proflgfoa (by aMsa&y 
5 chromatography), and for configuring assays kt wMdh the fesao^D protein was 
captured 07 immobilized using the 178.1 antibody. DET2 is d tesEa-htoidte 
sequence sag (SEQ XD NO: 2) which binds tto akkeE-mataimag sqsbbbs oxcd was aased 
for purification proposes. Spacer (SEQ ID NO: 3) was uffissd m dssig@ a BsmHl 
restriction site at the indicated positions of the construct. The -e&- refera to a 

10 recognition sequence (SEQ ID NO: 4) for the enteroMnase protease which provides 
for the optional removal of the tags from the &c SH2 domain, thus producing a sc 
SH2 domain that contains no extraneous amino acids. A src SH2 domain which 
contains no extraneous amino acids are preferable to tagged protein for 
crystallography studies. 

1 5 The DNA sequence encoding each DETl-DET2-spacer-ek-sro SB2 was 

designed such that the indicated restriction sites (BamHl and Xbsl) flank she spacer- 
ek-src SH2 region, thereby allowing different spaesr-ek-SH2 contracts to be readily 
substituted into any one of the vectors as described in Procedure 2 below to create a 
DETl-DET2-spacer-e!e-SH2 tagged protein. The DNA sequence encoding the 

20 DET 1 -DET2- spacer-ek- sre SH2 construct was also designed such that the entire 
tagged SH2 domain can be moved as an Ndel-Xbal fragment into any expression 
vector containing an Ndel site at an appropriate distance downstream of £L 
transcription and translation regulatory sequences and a downstream cloning site 
compatible with Xbal. Although any suitable vector would yield similar resu!ts(e.g., 

23 pET-1 la; Novagen, Inc.), the vector used in the instant experiments was 2L apH 

expression vector pEAlKnRBS3. This vector is a derivative of the series of vectors 
described in Shatzman, A, Gross, M 9 and Rosenberg, M, 1950, "Expression using 
vectors with phage lambda regulatory sequences", In: Cunent Protocols in Molecular 
Biology (F.A. Ausubel et al , eds.), pp. 16.3.1-16.3.11, Greene Publishing and Wiley- 

30 Imerscience, N.Y. (hereinafter F.A. Ausubel etal.). The specific vector 

pEAlKnRBS3 is described in Bergsma et al, 1991, J- Biol. Chero. 26&23204-23214. 
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The procedures below describe the expression of chicken sic and the human 
sic SH2 domains. First, the chicken src SH2 domain was expressed as DET1- 
DET2-spacer-SH2. Then, the other was inserted into this vector in place of chicken 
src to express proteins in the form DETl-DET2~spacer*ek-sic SH2 as described in 
5 procedures 1 and 2 below. 

Procedure 1: Cloning and Expression of chicken sic SH2 domain containing tags 
DET1 and DET2 (DETl-DET2-spacer-SH2)» 

A DNA sequence encoding the tagged protein DETl~DET2-spacer-SH2 was 
10 PGR amplified from a cDNA clone containing the chicken sic gene (pSH; Levy et al 
1986. Proc- Natl. Acad. Sri. USA 83:42281 by methods well known to those skilled 
in the art by using the following primers: 

5' 

15 TTC£AIAI£AAAAGTATTXX3TA 

O^tXACX^CXXKiATCCCro 3' (SEQ ID NO: 7) 

The underlined sites are an Ndel recognition site (5*) and a BamHI 
recognition site (3*). 

20 

5 f OGAATICIAfiATTACTAGGAOGTGGGGCAGAOGTT 3' (SEQ ID NO: 8) 
The underlined region is an Xbal recognition site. 

The PGR product was digested with Ndel and Xbal, followed by isolation of 
25 the digested fragment on an agarose gel. The fragment was ligated into Ndel-Xbal- 
digested pEA!KnRBS3 vector (Bergsma et al, supral that had been agarose gel 
purified as a 6.5 kbp fragment The ligation reaction was used to transform E* coH 
MM294cI* (F.A. Ausubel et al., smaa). A plasmid containing an insertion of the 
correct fragment was identified and confirmed by DNA sequencing. The resultant 
30 plasmid encodes DET 1 -DET2-spacer- SH2 under the control of the phage laroda P L 
promoter and regulatory system. Plasmid DNA was purified from MM294cF and 
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used to transform 2L coli strain AR120. In this host strain, expression of the phage 
promoter can be induced by addition of nalidixic acid to the growing culture as 
described in F.A. Ausubel et al, supra- Nalidixic acid induction of AR120 
containing this plasmid, followed by analysis of the cellular proteins on an SDS- 
5 polyacrylamide gel stained with Coomassie Blue (RA. Ausubel et aL v supraV 
resulted in appearance of a protein band with an apparent nx>lecular weight of 
15,000; this band was not seen in uninduced cells or in induced cells containing 
pEAlKnRBS3 lacking the PCR amplified fragment Western blotting confirmed 
that the induced protein band reacted with the anti-DETl monoclonal antibody 
10 178.1. 

Procedure 2 : Cloning, expression and purification of human sic SH2 domain 
containing tags and an enteroltinase proteolytic cleavage site (DET1-DET2- spacer- 
ek-src SH2). 

IS A DNA sequence encoding protein ek-src SH2 was PCR amplified from a 

cDNA clone containing the human src gene (c-src SH2 DNA sequence identical to 
that described in TakeyaX and Hanafusa, H, 1983 Cell 32:881-890) using the 
following primers: 

20 5' C GGGATCC TGGACGACXiACGAC^ 3' 
(SEQ ID NO: 9) 

The underlined site is a BamHI recognition site. 

25 5* GGAA TTCTAGA CTATTAGGACGTGGGGCACACGGT 3' (SEQ ID NO: 10) 

The underlined region is an Xbal recognition site. 

The PCR product was digested with BamHI and Xbal, followed by isolation 
30 of the digested fragment on an agarose gel. The fragment was ligated into BamHI- 
Xbal-digested expression vector containing the tagged chicken src gene DET1- 
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DET2-spacer-SH2 described in Procedure J above. In shas vector, she BamHX si«e 
is located between she coding regions for DET2 and SH2, and she Xbal site is 
located after the 3 9 end of the SH2 coding region. The liga£soft reaction was used so 
transform & figg MM294cF. The construct DETl-BETZ-spacer^-sra SH2 was 
5 confirmed by DNA sequencing (SEQ ID NO: S) and induced 2a JSL /saU straxa 
AR120 as described in Procedure 1 above. A CBomas$ie-Bl^-sGaiEed 0 Wes&sfa- 
blot-positive induced protein band with an apparent molecule weight of 36,000 was 
observed oUta* raalMmc acid induction 

Cells wese Jysed as neutral pH by soaicsakwi in the pre^mo© of lysosyasze. 
10 After centrifugation, the soluble extract was chromatography on a Ni~NTA colaxsnn. 
After washing the column with equilibration buffer (Tris buffer pH 8 containing 0.5 
M Nad) and the same buffer containing IS mM imkSaaolle, the prosein, DET1- 
BET2-space8--ek-sro SH2, was eluted in highly purified foma with 23 mM imidazole 
in equilibration buffer. 

15 After manually spotting Mab 173.1 on a PVDF membrane and addition of 

tagged src SH2 domain, probing with a biotinylated phosphopeptide ligand fc? src 
SH2 (SEQ ID NO: 6), followed by ECL «kt colorimetric detection showed a clear 
signal. No signal was seen with an equivalent amount of irrelevant antibody. 
Binding of the biotinylated phosphopeptide to the immobilized sro SH2 domain was 

20 inhibited by addition of a non-biotinylated phosphopeptide competitor of identical 
amino acid sequence to the biotinylated molecule. 

Example 3: Protease assay. 

As with the kinase assay D it is necessary to identify she optimum substrates 
25 for the target library components before configuring the SETL scraen. Once this is 
known, substrates are synthesized with the appropriate tag on one side of she 
cleavage site and a detection label such as biotin on the other. Once immobilised, 
protease activity can be identified by loss of the label from the target site. 

30 Example 4: Biopanning method 

Screening of bead-based compound libraries with multiple targets. 
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Each gene of a Target Library is expressed with two epitope tags: a common 
tag C which is the same for all genes and a unique tag T which is different for each 
gene. A combinatorial library is synthesized on beads by standard methods. The 
combinatorial library is screened with the Target Library by measurement of the 
5 association of tag C with the beads and separation of individual positive beads either 
manually or by automated methods. Following separation, the proteins are eluted 
from each positive bead* for example by low pH washing, and allowed to bind to an 
array of spatially encoded antibodies against the unique tags T. The location, thus 
identification, of the spectrum of proteins from the positive beads is determined by 

10 probing the matrix with an antibody against C, such probe being labeled with a 
readily detectable marker such as biotin, with subsequent readout Further, active 
compounds are identified by cleavage from the positive beads with subsequent 
chemical/physical analysis. 

While the preferred embodiments of the invention are illustrated by the 

IS above, it is to be understood that the invention is not limited to the precise 

instructions herein disclosed and that the right to all modifications coming within 
the scope of the following claims is reserved. 



-15- 



WO 96/24847 



PCTYUS96/02490 



SEQUENCE LISTING 

5 (1) GENERAL INFORMATION 

(i) APPLICANT: DUNNINGTON, DAMIEN 

<ii) TITLE OF THE INVENTION: A PROCESS FOR IDENTIFYING 
10 PHARMACEUTICALLY ACTIVE AGENTS 

(ill) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

IS (A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swede land Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY : USA 
20 (F) ZIP : 19406 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
25 (C) OPERATING SYSTEM: DOS 

<D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

30 (B) FILING DATE: 

(C) CLASSIFICATION: 

(vil) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/497,357 
35 (B) FILING DATE: 30-Jun-1995 



(viii) ATTORNEY /AGENT INFORMATION: 
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10 



25 



(A) NAME: Dustman/ Wayne J 

(B) REGISTRATION NUMBER: 33,870 

(C) REFERENCE /DOCKET NUMBER: P50323-3 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-5023 

(B) TELEFAX: 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:li 



<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 11 amino acids 
IS (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<li) MOLECULE TYPE: peptide 
20 <iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l; 



Lys Ser lie Arg lie Gin Arg G^y Pro Gly Arg 
15 10 

30 

(2) INFORMATION FOR SEQ ID NO:2: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: € amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: peptide 
(ill) HYPOTHETICAL: NO 
(lv) ANT I SENSE: NO 
5 (v) FRAGMENT TYPE: internal 

(vl) ORIGINAL SOURCE: 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



10 Hla His His His His His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 3: 



IS (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY : linear 

20 

(11) MOLECULE TYPE: peptide 
(ill) HYPOTHETICAL: NO 
(lv) ANT I SENSE: NO 
(v) FRAGMENT TYPE: internal 
25 (vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Gly lie Leu 
30 l 



(2) INFORMATION FOR SEQ ID NO: 4: 



(1) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
5 (iv) ANT I SENSE : NO 

(v) FRAQ4ENT TYPE : internal 
Cvi) ORIGINAL SOURCE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

10 

Asp Asp Asp Asp LyS 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

15 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
<B) TYPE: amino acid 
<C) STRAND EDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

25 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

30 Met Lys Ser lie Arg lie Gin Arg Gly Pro Gly Arg His His His His 
15 10 15 

His His Gly lie Leu Asp Asp Asp Asp Lys Ala Glu Glu Trp Tyr Phe 

20 25 30 

Gly Lys lie Thr Arg Arg Glu Ser Glu Arg Leu Leu Leu Asn Ala Glu 
35 35 40 45 

Asn Pro Arg Gly Thr Phe Leu Val Arg Glu Ser Glu Thr Thr Lys Gly 
50 55 60 
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Ala Tyr Cys Leu Ser Val Ser Asp Phe Asp Asn Ala Lys Gly Leu Asn 
65 70 75 80 

Val Lys His Tyr Lys lie Arg Lys Leu Asp Ser Gly Gly Phe Tyr lie 
85 90 95 

5 Thr Ser Arg Thr Gin Phe Asn Ser Leu Gin Gin Leu Val Ala Tyr Tyr 
100 105 110 

Ser Lys His Ala Asp Gly Leu Cys His Arg Leu Thr Thr Val Cys Pro 
115 120 125 

Thr Ser 
10 130 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i> SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 <li> MOLECULE TYPE: peptide 

(ill) HYPOTHETICAL: NO 

<iv> ANT I SENSE: NO 

<v) FRAO<ENT TYPE: internal 

<vi) ORIGINAL SOURCE: 
25 <lx) FEATURE: 

(A) NAME /KEY: Other 

(B) LOCATION: 4 % ..4 

(D) OTHER INFORMATION: phosphorylated tyrosine r 
30 esldue 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Glu Pro Gin Tyr Glu Glu lie Pro lie Tyr Leu 
35 1 5 10 15 
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(2) INFORMATION FOR SCO 10 NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: CDNA 
10 (ill) HYPOTHETICAL: NO 

<iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

IS <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TTCCATATGA AAAGTATTCG TATTCAGCGT GGCCCGGGCC GTCACCACCA CCACCACCAC 
60 

GGGATCCCCG CTGAAGAGTG GTACTTT 
20 87 



(2) INFORMATION FOR SEQ ID NO: 8: 



(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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GGAATTCTAG ATTACTAGGA CGTGGGGCAG ACGTT 
38 



(2) INFORMATION FOR SEQ ID NO: 9: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: Single 

10 <D> TOPOLOGY : linear 



(11) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 
IS (v) FRAGMENT TYPE: 

(Vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



20 CGGGATCCTG GACGACGACG ACAAAGCTGA GGAGTGGTAT TTT 
46 



(2) INFORMATION FOR SEQ ID NO: 10: 



25 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: , single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

35 (vi) ORIGINAL SOURCE: 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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GGAATTCTA6 ACTATTAGGA CGTGGGGCAC ACGGT 
36 
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What is claimed is: 



1. A process for identifying pharmaceutical^ active agents which 
comprises the steps of: 
5 a) simultaneously expressing a plurality of Uniquely Tagged Target 

Agents to fonn a Target Library; 

b) preparing an Agent Candidate Pool; and 

c) testing the Target Library and the Agent Candidate Pool in an assay 
which identifies agents having desired characteristics. 



2. A process for spatially encoding Target libraries which comprises the 
steps of: 

a) depositing a plurality of antibodies, each member being directed against a 
different epitope tag of a Target Library and each epitope tag of the Target Library 
corresponding to at least one antibody grouping, in an array, on a substrate; 

b) blocking the antibody region of the membrane with an inert protein; and 

c) exposing a Target Library to the antibody region. 



3. A spatially Encoded Target Library prepared as in claim 2. 

4. A process for simultaneously expressing a plurality of uniquely 
tagged genes to form a Target Library which comprises engineering each member of 
the plurality with an epitope tag capable of being recognized by a specified 
antibody, prior to simultaneously expressing the plurality. 

5. A Target Library prepared as in claim 4. 

6. A method of screening bead based combinatorial compound libraries 
which comprises the steps of: 

a) expressing a Target Library wherein to each member is affixed a 
common tag and a unique tag; 
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b) exposing a bead based combinatorial compound library to the Target 
library and separating the positive hits by detection of the common tag; 

c) eluting the active proteins off of the beads; and 

d) identification by means of the unique tag. 

5 

7. Pharmaceutical^ active agents identified by the process of claim 1. 
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